Software tools and test data for research and testing of page-reading OCR systems
Identifieur interne : 001373 ( Main/Exploration ); précédent : 001372; suivant : 001374Software tools and test data for research and testing of page-reading OCR systems
Auteurs : Thomas A. Nartker [États-Unis] ; Stephen V. Rice [États-Unis] ; Steven E. Lumos [États-Unis]Source :
- SPIE proceedings series [ 1017-2653 ] ; 2005.
Descripteurs français
- Pascal (Inist)
English descriptors
- KwdEn :
Abstract
We announce the availability of the UNLV/ISRI Analytic Tools for OCR Evaluation together with a large and diverse collection of scanned document images with the associated ground-truth text. This combination of tools and test data will allow anyone to conduct a meaningful test comparing the performance of competing page-reading algorithms. The value of this collection of software tools and test data is enhanced by knowledge of the past performance of several systems using exactly these tools and this data. These performance comparisons were published in previous ISRI Test Reports and are also provided. Another value is that the tools can be used to test the character accuracy of any page-reading OCR system for any language included in the Unicode standard. The paper concludes with a summary of the programs, test data, and documentation that is available and gives the URL where they can be located.
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream PascalFrancis, to step Corpus: 000454
- to stream PascalFrancis, to step Curation: 000334
- to stream PascalFrancis, to step Checkpoint: 000380
- to stream Main, to step Merge: 001410
- to stream Main, to step Curation: 001373
Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">Software tools and test data for research and testing of page-reading OCR systems</title>
<author><name sortKey="Nartker, Thomas A" sort="Nartker, Thomas A" uniqKey="Nartker T" first="Thomas A." last="Nartker">Thomas A. Nartker</name>
<affiliation wicri:level="2"><inist:fA14 i1="01"><s1>Information Science Research Institute (ISRI) University of Nevada, Las Vegas</s1>
<s2>Las Vegas, NV 89154-4021</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Nevada</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Rice, Stephen V" sort="Rice, Stephen V" uniqKey="Rice S" first="Stephen V." last="Rice">Stephen V. Rice</name>
<affiliation wicri:level="2"><inist:fA14 i1="01"><s1>Information Science Research Institute (ISRI) University of Nevada, Las Vegas</s1>
<s2>Las Vegas, NV 89154-4021</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Nevada</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Lumos, Steven E" sort="Lumos, Steven E" uniqKey="Lumos S" first="Steven E." last="Lumos">Steven E. Lumos</name>
<affiliation wicri:level="2"><inist:fA14 i1="01"><s1>Information Science Research Institute (ISRI) University of Nevada, Las Vegas</s1>
<s2>Las Vegas, NV 89154-4021</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Nevada</region>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">05-0361379</idno>
<date when="2005">2005</date>
<idno type="stanalyst">PASCAL 05-0361379 INIST</idno>
<idno type="RBID">Pascal:05-0361379</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000454</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000334</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000380</idno>
<idno type="wicri:doubleKey">1017-2653:2005:Nartker T:software:tools:and</idno>
<idno type="wicri:Area/Main/Merge">001410</idno>
<idno type="wicri:Area/Main/Curation">001373</idno>
<idno type="wicri:Area/Main/Exploration">001373</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">Software tools and test data for research and testing of page-reading OCR systems</title>
<author><name sortKey="Nartker, Thomas A" sort="Nartker, Thomas A" uniqKey="Nartker T" first="Thomas A." last="Nartker">Thomas A. Nartker</name>
<affiliation wicri:level="2"><inist:fA14 i1="01"><s1>Information Science Research Institute (ISRI) University of Nevada, Las Vegas</s1>
<s2>Las Vegas, NV 89154-4021</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Nevada</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Rice, Stephen V" sort="Rice, Stephen V" uniqKey="Rice S" first="Stephen V." last="Rice">Stephen V. Rice</name>
<affiliation wicri:level="2"><inist:fA14 i1="01"><s1>Information Science Research Institute (ISRI) University of Nevada, Las Vegas</s1>
<s2>Las Vegas, NV 89154-4021</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Nevada</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Lumos, Steven E" sort="Lumos, Steven E" uniqKey="Lumos S" first="Steven E." last="Lumos">Steven E. Lumos</name>
<affiliation wicri:level="2"><inist:fA14 i1="01"><s1>Information Science Research Institute (ISRI) University of Nevada, Las Vegas</s1>
<s2>Las Vegas, NV 89154-4021</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Nevada</region>
</placeName>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">SPIE proceedings series</title>
<idno type="ISSN">1017-2653</idno>
<imprint><date when="2005">2005</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">SPIE proceedings series</title>
<idno type="ISSN">1017-2653</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Accuracy</term>
<term>Algorithm</term>
<term>Availability</term>
<term>Data gathering</term>
<term>Document image processing</term>
<term>Optical character recognition</term>
<term>Performance evaluation</term>
<term>Program verification</term>
<term>Reading device</term>
<term>Software tool</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Outil logiciel</term>
<term>Appareil lecture</term>
<term>Reconnaissance optique caractère</term>
<term>Disponibilité</term>
<term>Traitement image document</term>
<term>Evaluation performance</term>
<term>Algorithme</term>
<term>Collecte donnée</term>
<term>Précision</term>
<term>Vérification programme</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">We announce the availability of the UNLV/ISRI Analytic Tools for OCR Evaluation together with a large and diverse collection of scanned document images with the associated ground-truth text. This combination of tools and test data will allow anyone to conduct a meaningful test comparing the performance of competing page-reading algorithms. The value of this collection of software tools and test data is enhanced by knowledge of the past performance of several systems using exactly these tools and this data. These performance comparisons were published in previous ISRI Test Reports and are also provided. Another value is that the tools can be used to test the character accuracy of any page-reading OCR system for any language included in the Unicode standard. The paper concludes with a summary of the programs, test data, and documentation that is available and gives the URL where they can be located.</div>
</front>
</TEI>
<affiliations><list><country><li>États-Unis</li>
</country>
<region><li>Nevada</li>
</region>
</list>
<tree><country name="États-Unis"><region name="Nevada"><name sortKey="Nartker, Thomas A" sort="Nartker, Thomas A" uniqKey="Nartker T" first="Thomas A." last="Nartker">Thomas A. Nartker</name>
</region>
<name sortKey="Lumos, Steven E" sort="Lumos, Steven E" uniqKey="Lumos S" first="Steven E." last="Lumos">Steven E. Lumos</name>
<name sortKey="Rice, Stephen V" sort="Rice, Stephen V" uniqKey="Rice S" first="Stephen V." last="Rice">Stephen V. Rice</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001373 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001373 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= Pascal:05-0361379 |texte= Software tools and test data for research and testing of page-reading OCR systems }}
This area was generated with Dilib version V0.6.32. |